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(57) Abstract 

The invention relates to a 
protein stabilizing sequence parti- 
cularly useful for stabilization of 
proteolytically sensitive proteins. 
The sequence includes a relatively 
small number of amino adds that 
may be expressed fused with a pro- 
teolytically sensitive protein. The 
most effective stabilization se- 
quences assume a-helix structures 
with a hydrophobic face and a posi- 
tively charged polar face which ap- 
pear to require proper orientation 
with respect to each other. Other 
aspects of the invention include 
cloning vectors incorporating a 
gene sequence encoding the stabili- 
zation polypeptide and production 
of stabilized antigenic proteins. 
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DESCRIPTION 

SEQUENCE FOR STABILIZING PROTEINS IN BACTERIA 
PROTEIN STABILIZATION SEQUENCE 

5 

The United States Government may have certain rights 
in the present invention pursuant to the terms of Grant 
No. 86-13978 awarded by the National Science Foundation. 

10 

BACKGROUND OF THE INVENTION 
Field of the Invention 

15 The invention relates to an amino acid sequence 

useful for stabilization of otherwise unstable proteins, 
particularly proteolytically sensitive proteins. The 
invention also relates to methods of producing stabilized 
proteins by direct attachment of the stabilizing sequence 

20 or through fusion proteins expressed from recombinant 
host cells. 

Description of Related Art 

25 One of the major problems in the use and preparation 

of therapeutic and commercial proteins is degradation by 
cell proteases. This may occur in vivo in plasma or 
inside a host cell when recombinant methods of 
polypeptide production are employed. Degradation plays 

30 an obvious role in the elimination of damaged or abnormal 
proteins but also affects half lives of normal proteins. 
Individual protein turnover rates may vary 1000 fold, 
depending on environmental and structural factors. 

35 Two general factors contribute to the half -life of 

any given protein. One factor includes "global" features 
such as large size, hydrophobicity, thermal instability 
and charge characteristics. The other factor is related 
to sequence specific parameters of particular a-amino 
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terminii, asparagine residues and so-called PEST 
sequences. Although correlations have been made between 
particular features and susceptibility to degradation, 
explicit degradation signals and pathways are not well 
5 defined. It is not even known whether peptide bond 
cleavage or biochemical marking is the primary event 
signaling protein destruction. 

In contrast to degradation factors, there appear to 
10 be stabilization factors that enhance stability of a 

protein toward proteolysis. It has been suggested that 
both thermodynamic stability and proteolytic 
susceptibility are major determining factors in rate of 
degradation (Parsell and Sauer, 1989) . There has been 
15 some evidence that particular sequences at the N-terminus 
of lambda repressor in Escherichia coli aid in 
maintaining a stable tertiary structure and therefore 
slow proteolytic degradation (Parsell and Sauer, 1989) , 
Some evidence also suggests that the increased stability 
20 of certain fusion proteins may derive from the ability of 
a stable fusion partner such as protein G or 
0-galactosidase to stabilize the tertiary structure of a 
proteolytically sensitive protein (Hellebust et al., 
1989) . 



25 



30 



35 



A particular amino acid sequence attached as a 
"tail" to the c-terminal of Arc, a small dimeric 
DNA-binding protein found in bacteriophage P22, has been 
found to stabilize that protein against proteolysis. The 
primary sequence of the tail consists of 25 amino acids 
which when attached to a heterologous protein, the LP57 
mutant of X-repressor in Escherichia coli, also 
stabilized that protein CBowie and Sauer, 1989) . 

Nevertheless, up until now, an amino acid sequence 
attachable to a proteolytically sensitive protein has not 
been found which will generally increase stability of 
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these proteins. The value of such a sequence would lie 
in its ability to increase the half-life of commercially 
desired proteins for production in recombinant cell hosts 
and the potential to protect proteinaceous vaccines from 
5 serum protease degradation when injected directly into an 
animal. 



10 



15 



SUMMARY OF THE INVENTION 

The invention generally relates to a protein 
stabilization sequence. The sequence is comprised of 
amino acids that may be attached to proteolytically 
sensitive proteins. 



The stabilization sequence includes a limited number 
of amino acids ranging from about ten to about 50 
residues. The amino acids need not be confined to a 
particular type of amino acid; however, the amino acid 

20 residues will be such that the secondary and tertiary 
structure assumes the form of an outwardly directed, 
properly aligned hydrophobic face and a positively 
charged polar face. The importance of this structure has 
been shown by substituting two of the amino acids causing 

25 the structure to have a twisted form. This form was 
shown to be less stable. The alignment of amino acid 
groups within a particular a-helical form appears to be 
important in order for the amino acid sequence to have 
maximal effect as a stabilization sequence. 

30 

The a-helical structure assumed by the amino acid 
protecting sequence is an important aspect of the 
invention. Thus it is number and type of amino acid 
residues involved, and the structure assumed by the amino 
35 acid residues making up the structure. Twisted forms of 
the a-helix structure with the hydrophobic and polar 
faces out of alignment provide less stability to fused 
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heterologous proteins. Thus in preferred embodiments the 
outwardly directed hydrophobic face of the structure is 
in alignment with positively charged face of the o-helix. 

5 Although it is believed that such stabilization 

structures could be formed with as many as a 100 amino 
acid residues, the amino acid sequence used is preferably 
from about 10 to about 50 acid residues and more 
preferably 29 amino acid residues. A particularly 
10 preferred amino acid sequence is shown in Figure 1. 

Another aspect of the invention is the attachment of 
the protein stabilization sequence to a proteolytically 
sensitive enzyme. Such attachments may be made at the 

15 time of synthesis of the proteolytically sensitive 

protein of interest or alternatively may be used in in 
vitro situations where it is desired to protect a 
proteolytically sensitive protein from degradation. An 
example might include the use of the protecting sequence 

20 to protect an antigenic protein that is being used to 
promote antibody formation in vivo. Protection of the 
antigen from serum proteases would provide more effective 
use of expensive antigenic materials that are in short 
supply. 

25 

In a preferred embodiment, the protein stabilization 
sequence is prepared in the form of a cassette in which 
the DNA segment encoding the sequence is incorporated 
into a cloning vector. Proper construction of the vector 
30 results in a protein of interest being expressed with the 
polypeptide protecting group attached to the protein, 
thus protecting it from protease degradation within the 
host cell in which it is expressed. 



35 



Attachment of the protein stabilization sequence is 
generally to the amino-terminal end of the 
proteolytically sensitive protein or to the carboxy 
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terminal end. In some cases, for example, exceptionally 
long proteolytically sensitive proteins it may be 
desirable to attach the stabilization sequence to both 
termini. In vitro attachment methods may be readily 
5 accomplished using chemical attachments well known to 

those of skill in the art. Attachment of such sequences 
in vivo is possible by constructing appropriately 
designed cloning rectors. 

10 The protein stabilization sequence described has 

general utility and is not limited to stabilization of 
the polymerase sigma factor from which it was originally 
derived. RNA polymerase sigma factors from B. suJbtilis 
or E. coli have been prepared as fusion proteins with the 

15 stabilization factor originally isolated from B. suJbtilis 
sigma factor. It is envisioned that virtually any DNA 
segment encoding a polypeptide could be constructed in a 
cloning vector or expression vector with the DNA segment 
encoding the stabilization factor sequence. Examples of 

20 other proteolytically sensitive proteins that might be 
protected with the stabilization sequence include 
proinsulin, epidermal growth factor, inter leukin, 
interferon, somato tropin, insulin-like growth factor, 
phosphatase, immunoglobulin Pv and the like. Moreover 

25 expression may be from a variety of suitable bacterial 
cells provided that they can be suitably transformed by 
an appropriately constructed cloning vector. Examples of 
suitable host microorganisms include E. coli, B. 
suJbtilis, S* tymphyrium and the like. 

30 

The invention is also envisioned as a method of 
enhancing stability of recombinant proteins. A first 
step would be to obtain a gene segment encoding the 
protein stabilization sequence. Such a sequence is not 
35 limited to a particular number or type of amino acid 

residues provided that the residues form the appropriate 
structure having the stabilization properties* Thus 
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under certain conditions it may be desirable to encode up 
to 100 amino acids in the sequence. In a further step a 
recombinant vector is prepared. This vector will have a 
gene encoding the protein stabilization sequence selected 
5 as well as a gene encoding a protein desired to be 

stabilized. Generally, this is accomplished by preparing 
a cloning site within a plasmid vector that encodes the 
protein stabilization sequence, then inserting into that 
site a gene or a gene segment encoding the protein 

10 desired to be stabilized. The cloning site may be 

modified to include one or more unique restriction sites 
but in general will be constructed so that insertion of 
the gene of choice will restore a proper reading frame 
between the protein stabilization sequence and the 

15 cloning site through the desired gene segment. Such a 
recombinant vector may then be used to transform a host 
cell. Transformed host cell colonies will then be 
selected, for example by incorporating selected genes 
within the recombinant vectors such as antibiotic 

20 resistance genes or alkaline phosphatase expressing genes 
which will elicit positive or negative responses 
depending on whether the desired gene has been cloned 
into the cloning site, once transformed host cell 
colonies are identified, expression of a heterologous 

25 protein fused with the af oredescribed protein 

stabilization sequence may be affected. This is 
generally performed using standard culture methods well 
known to those of skill in the art. 

30 Expression of a wide variety of heterologous 

proteins fused with the protein stabilization sequences 
should be possible. In general proteolytically sensitive 
proteins suitable for production as stable heterologous 
proteins from prokaryotic hosts will be proteins that are 

35 expressible as active fusion proteins. Such proteins 
include 6-galactosidase, sigma factor and the like. 
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BRTRP DESCRIPTION OP TBS DRAWINGS 

Figure 1 shows an amino acid sequence capable of 
forming a protecting hydrophobic faced a-helix structure 
5 when attached to a proteolytically sensitive protein. 

Figure 2 shows the nucleic acid sequence encoding a 
29 amino acid protective sequence. 

10 Figure 3 illustrates generally the a-helix formed by 

amino acid sequences that confer proteolytic stability on 
proteins to which they are N-terminally attached. A 
hydrophobic face and a positively charged polar face are 
aligned on the a-helix. 

15 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Materials and Methods 

20 

Bacterial Strains and Plasmids 

B. svbtilis strains JH-642 (pheA trpC2) and 1A287 
(rpsl) were obtained from J. Hoch and the Bacillus 

25 Genetic Stock Center (Ohio State University), 

respectively. SR104 has an internal deletion in sigE. 
It was created by transformation of strain SMY with DNA 
from CM01 (trpC2 pheAl AsigE: :erm) . The M13 host strains 
CJ236 (dut iing) and JM103 (dut+ ung+) were obtained from 

30 V. Deretic (University of Texas Health Science Center at 
San Antonio) . Escherichia coli strain TB-1 was provided 
by D. Kolodrubetz (University of Texas Health Science 
Center at San Antonio) . Plasmids pSR-5 (spoIID: zlacZ) 
(Rong et al. 1986), pSI-1 (pSpac) (Yansura and Henner, 

35 1984), pGSIIGll (sigE) (Stragier et al., 1984), and 

pSGMU31 (Errington, 1986) have been described, pJM102, 
obtained from J. Hoch, is pUC18 (Yanisch-Perron et al., 
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1985) with, a chloramphenicol acetyltransf erase (cat: 950 
base pairs [bp]) gene cloned into its unique Ndel site, 
PSGM031A2 was derived from P SGMU31 by cutting the plasmid 
at its two BamHJ sites and single Bglll site and 
relegating the two major fragments. This results in loss 
of a small, nonessential BamffJ-BgJJJ fragment (150bp) and 
the creation of a single BamHJ site in the vector. 
PSGMU31A2 has the BamffX site and the lacZ gene of the 
original plasmid downstream of the plasmid 's lac 
promoter. pSR22 is pBR322 (Bolivar et al., 1977) and 
PUB110 (Gryczan et al. f 1978) joined at their BawHI sites 
plus a 1.1-kbp fragment encoding sigE (Stragier et al. 
1984) . The sigE fragment was cloned into the EcoRI site 
of pBR322 by using EcoRI linkers. pSR51 and pSR54 are 
variants of pSR22 with missense mutations in the coding 
sequence of sigE. 



ol iaonuci *>ot ide-D 5 i-^cted Mutagenesis 



20 



The 1.1-kpb PstI fragment of pGSIIGll was cloned 
into M13mpl9 (Yanisch-Perron et al. 1985), and its 
orientation within the vector was determined by 
restriction endonuclease analysis of RF DNA. To 
25 construct slgEA84, an oligonucleotide (60-mer) that 

carried the sequence 30 bases upstream of and including 
the ATG codon (nucleotides 179 to 181 of the 1.1-kpb 
fragment) of P 31 and 30 bases downstream of the o* amino 
terminus (nucleotides 266 to 268) was synthesized. The 
30 60-mer was hybridized to single-stranded M13 DNA 

containing its complementary sequence within the cloned 
1.1-kpb PstI fragment. The F»-specific sequence was 
"looped out" as a consequence of using DNA polymerase I 
to synthesize a complement to the circular DNA by using 
35 the oligonucleotide as primer. Enrichment for the 
deleted sequence was accomplished by Si nuclease 
treatment of the 84-base loop following the 
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polymerization reaction. The hybridization, 
polymerization, and SI treatments were accomplished as 
described by adelman et al. (Adelman et al. 1983). 
Following transformation of E. coli JM103, mutagenized 
5 clones were screened by restriction endonuclease analysis 
for reduction in the size of the cloned PstI fragment 
(i.e., 1.1 kbp to 1.02 kbp) and the loss of the Hindll 
site originally present in the loop region. 
Approximately 5% of the resulting clones had the desired 
10 mutation. 

Additional mutant clones sigEA81, sigEA48, sigEA252, 
and sigEA253 were constructed by using oligonucleotides 
of 30 bases (30-mers) (Kunkel, 1984). M13mpl8 containing 

15 either the original i.l-kbp sigE fragment for sigEA81 and 
sigEA84) or 1.02-kbp sigEA84 fragment (for sigEA252 and 
sigEA253) was plague purified twice on E. coli CJ236 (dut 
ung) and grown on this host to incorporate uracil into 
the phage DNA. Following hybridization to a 30-mer, 

20 Sequehase:2.0 (United States Biochemical Corp., 

Cleveland, Ohio) was used to synthesize the complementary 
strand. Selection for the strand polymerized in vitro 
was accomplished by transforming and plating the 
polymerization mixture on E. coli JM103 (dut+ ung+) , 

25 which selects against the uracil-containing template. 

From 50 to 90% of the clones arising from the reaction 
contained the desired mutations. These were identified 
by size changes in the cloned B. subtilis fragment and by 
DNA sequencing. 

30 

Strain Construction 

The E. coli lac gene was translationally coupled to 
sigE by cloning a 340-bp Pstl-SauJIIAl fragment of the 
35 1.1-kbp clones sigE gene into Pstl-BamHI-cut pSGMU31A2. 
This coupled the 5' end of lacZ in frame to sigE at a 
site 165 bp downstream of the sequence encoding the amino 
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terminus of P 31 . In a similar fashion, the 256-bp 
Pstl-SauIIIAl fragment from sigEA84 was cloned into this 
vector. This construct was formed at the same site 
within sigE but had only 81 bp of the sigE protein-coding 
5 region upstream of lacZ. JH642 was transformed with 

PSGMU31A2 containing the coding sequence for either the 
sigE+- or sigEA84-encoded amino terminus. Clones in 
which the plasmid had integrated into the B. subtilis 
chromosome were selected by chloramphenicol resistance 
10 (2.5 /ig/ml) and screened by Southern blot analysis for 
integration into the sigE locus with the proper sigE 
allele positioned upstream of lacZ. 

Strains of JH642 in which the wild-type sigE locus 
15 had been replaced by mutant sigE alleles were constructed 
by congression of the mutant alleles, cloned as a 1- to 
1.1-kbp Pstr fragment in JM102, with a streptomycin 
resistance (Str 1 ) marker from B. subtilis 1A287, Str* 
colonies were screened for Spo" by failure of the colonies 
20 to run brown on DS (Schaeffer et al., 1965) plates after 
24 h at 37 °C. Spo" clones were then screened for Cm* to 
identify those in which the mutant allele had entered the 
chromosome by gene replacement rather than integration of 
the entire plasmid. Chromosomal DNA from clones that met 
25 these criteria were screened by Southern blot analysis 
for the predicted gene replacement. 

^l ysis of Bx t ™^g for P 3 * and r^-like Proteins 



B. subtilis cultures were grown in DS medium, cells 
were harvested, and protein extracts were prepared as 
described previously (Trempy et al., 1985), except that 
the ammonium sulfate step was omitted. Protein samples 
(100 pg) (determined by the Coomassie method, Bio-Rad 
35 Laboratories) were precipitated with 2 volumes of cold 

ethanol, suspended in sample buffer, and fractionated on 
sodium dodecyl sulfate-polyacrylamide gels (12% 
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acrylamide). Following electrophoretic transfer to 
nitrocellulose and blocking of the nitrocellulose with 
Blotto, the protein bands were probed with an anti-P 31 /<x e 
monoclonal antibody (Trempy et al., 1985). Bound 
5 antibody was visualized by using a horseradish 

peroxidase-conjugated goat immunoglobulin against mouse 
immunoglobulin (Hyclone Laboratories, Inc.) or "'i-rabbit 
anti-mouse immunoglobulin antibody (K. Krolick, 
University of Texas Health Science Center at San 
10 Antonio) . 



fl-GalactofH dase Assays 

B. subtilis strains carrying either pSR5 or an 
15 integrated pSGMU31A2 plasmid were grown in DS medium and 
harvested at various times during growth and sporulation. 
Cells were disrupted by passage through a French pressure 
Cell (twice at 20,000 lb/ in 2 ) and analyzed for 
B-galactosidase activity as described by Miller (Miller, 
20 1972) . 



DNA Sequencing 



DNA sequencing was performed by the method of Sanger 
25 et al. (1977) with the Seguenase reagents (U.S. 

Biochemical Corp.) and the protocol provided by the 
manufacturer . 



30 



Construction of the sj rfEABA allele. 



The sigE gene was reconfigured to encode a a B -like 
protein as its direct product. A single-stranded 
oligonucleotide was hybridized to a region within a 
single-stranded copy of this fragment (cloned in 
35 . M13mpl9) . This resulted in the looping out of the DNA 
that encodes the precursor element of P 31 . A complement 
to the M13 -containing DNA was synthesized by using the 
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oligonucleotide as a primer. Following second-strand 
synthesis and destruction of the loop by Si nuclease, the 
DNA mixture was introduced into E. coli by 
transformation. Cloned DNAs which both lost the Bindll 
5 site that is unique to the precursor portion of the 

sequence and contained a PstI fragment of approximately 
1.020 bp (rather than the parental 1.100 bp) were picked 
as putative sigE mutants (sigT&BA) , expected to contain a 
sequence in which the region encoding the o* amino 
10 terminus is positioned immediately downstream of the P 31 
initiation codon and ribosome-binding site. This was 
verified by sequencing the splice regions of 
constructions that were positive by the restriction 
endonuclease criteria. 



The splice region of one of the clones diverged from 
the parental sequence only after the initiating ATG 
codon. At that point, the sequence encoding the a* amino 
terminus began in frame with the ATG codon. 



The l.02-kbp PstI fragment carrying the siglASA 
allele was cloned into unique PstI sites of the plasmid 
vectors pJM102 and pSI-1 to form pJMA84 and pSlA84, 
respectively. The PstI site of each of these plasmids 

25 lay downstream of an IPTG 

(isopropylthiogalactopyranoside) -inducible promoter) Plac 

in pJMl02 and Pspac in pSI-1) . Both plasmids replicated 

in E. coli, but only pSI-1 replicated in B. subtilis. 

The sigmsi gene was cloned in the proper orientation for 

30 expression from the inducible promoters as verified by 

restriction endonuclease analysis of the ability of this 

gene to specify a product in E. coli. 
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EXAMPLE I 

Production of Siama Prote in in Escherichia coli 

Cultures of E. coli which carried either the mutant 
or wild-type sigE allele under Plac or Pspac control were 
induced with IPTG. The amount of P 31 or 0^84 present at 2 
h after induction was determined by Western immunoblot 
analysis. An autoradiogram of the results of this 
experiment is presented in Fig. 3. Significant amounts 
of protein that reacted with the anti-P 31 /^ antibody were 
present in all of the extracts. The extracts from 
strains carrying the sigrEA84 allele synthesized a protein 
(a B A84) with the apparent mobility of a E , while the 
wild-type allele specified a protein with the mobility of 
p 31 . In addition to the a E A84 band, extracts prepared from 
the sigEA84 strains contained one or more prominent 
lower-molecular-weight proteins that reacted with the 
antibody. The lower-molecular-weight proteins varied in 
abundance in different extract preparations and probably 
represent breakdown products of a*. They were similar in 
size to but of greater intensity than bands when 
analyzing extracts of B. subtilis cells that synthesize 
P 31 and <7 E (Jonas et al., 1989). Densitometry measurements 
of radioactively labeled antibody bound to the Western 
blots revealed that the amount of the antibody fixed to 
the two major bands seen in the sigEA84 allele encodes a 
protein with the apparent molecular weight of a E but the 
sigEA84 product accumulated to a level only 10 to 20% of 
that obtained from the wild-type sigE allele in E. coli. 

EXAMPLE 2 

gffect of the sipEA84 allele on B . subtilis 

JH642 was transformed with pJMA84. Cells that had 
integrated this nonreplicating plasmid into their 
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chromosomes were selected by chloramphenicol resistance. 
Recombination between pJMA84 and its homologous sequences 
on the B. subtills chromosome could occur either upstream 
or downstream of the A84 deletion. Recombination 
5 upstream of the deletion would place the mutant allele 

under the control of the spoIIG promoter, allowing it to 
be expressed. A recombination event downstream of the 
mutation would leave a wild-type allele as the expressed 
copy. The A84 deletion lay approximately 200 bp from the 
10 upstream end of the 1.02-kpb fragment. It was 
anticipated that approximately 20% of the 
chloramphenicol-resistant transformants should have the 
mutant allele as the expressed copy under conditions of 
random recombination. 



Of the 497 transformants receiving pJMA84, 127 (25%) 
were Spo" (i.e. , colonies remained white and began to lyse 
after 24 h at 37°C on DS medium) . In a similar 
experiment in which the transforming DNA was pJM102 with 
20 a wild-type sigE allele (PJME-l) , no Spo" transformants 

were observed (O of 463) . Southern blot analysis of four 
Spo- and four Spo" clones which had been transformed with 
PJMA84 revealed the wild-type sigE allele as part of the 
spoUG operon in the Spo + clones and the sigEA84 allele 
25 as the operon component in the Spo" clones. 

The sigB gene of JH642 was replaced with sigEA84 by 
transforming JH642 (Phe) with SMY chromosomal DNA (Phe + ) 
plus plasmid pJMA84. Phe + transformants were screened 

30 for Spo" clones. The chromosomal DNAs were then examined 
by Southern blot analysis. These DNAs contained a 
PstJ-generated fragment of 1.03 kbp rather than the 
parental fragment of 1.1 kpb, which hybridized to a 
sigE-specific DNA probe. One of these clones (SEA84-1) 

35 was characterized further. 
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The activity of the a E in clone a B A84-l was analyzed 
by using a reporter gene (lacZ) fused to a (^-dependent 
promoter (spoIID) (Rong et al., 1986). SEA84-1 carrying 
a spoIID: :lacZ fusion made virtually no 0-galactosidase 
5 during growth or sporulation in DS medium. Western blot 
analysis failed to detect any o* at times when a E is 
normally abundant. No sigma-type protein was detected in 
B. suJbtilis when an attempt was made to visualize its 
product by immunoprecipitation of a radiolabeled product 
10 or induction in B. suJbtilis from an spac promoter. Both 
of these methods generate readily detectable P 31 and a E 
from the wild-type sigrE allele. 



EXAMPLE 3 

15 

Translational Fusions to sicrE 



A translation fusion between sigE alleles and the JS. 
coll lacZ gene was constructed. 

20 

As described in the Materials and Methods section, a 
340-bp fragment containing upstream DNA and the coding 
sequence for the first 55 amino acids of P 31 was cloned 
into a plasmid vector (pSGMU3lA2) so as to create an 

25 in-frame fusion between P 31 and the E. coll lacZ gene. A 
similar fusion was created by using the analogous DNA 
fragment (256 bp) from sigEA84. The coding sequence for 
the first 26 amino acids from the sigEA84 gene was fused 
to lacZ. The vector used was incapable of replication in 

30 B. suJbtilis but carried an antibiotic resistance gene 

(cat) expressible in B. subtilis. Thus, transformation 
of B. subtilis with the fusion plasmids followed by 
selection for chloramphenicol resistance yielded clones 
in which the plasmid was integrated into its chromosomal 

35 homolog. Integration of the plasmid resulted in a Spo* 
phenotype owing to the positioning of the sigE: :IacZ 
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fusion gene within the spoIIG operon in place of the 
wild-type allele. 

When the wild-type sigE: :lacZ fusion plasmid was 
5 introduced into a sigE* B. subtilis strain, all of the 
clones (38 of 38) displayed a Lac + phenotype on plates 
(Table 1). in a similar experiment, only 25% (8 of 32) 
of the transformants receiving the sigEA84: :2acZ DNA were 
Lac + . As was the case in the previous transformation 

10 experiment, the integration event of : :2acZ into sigE 

could take place either upstream or downstream of the A84 
mutation. Recombination downstream of the mutation would 
result in creation of a wild-type sigE::IacZ gene. It 
was therefore possible that the Lac + clones represented 

15 sigE+::2acZ fusions and that sigEA84: :2acZ clones were 
Lac". A. Southern blot analysis of the chromosomes of a 
representative sample of Lac* and Lac" transformants that 
had received sigEA84::lacZ DNA was performed. Six of six 
Lac + clones had a DNA fragment of the size anticipated 

20 for wild-type sigE upstream of the lacZ gene, and six of 
six of the Lac- clones had a fragment of the size expected 
for sigEA84 upstream of lacZ. 

To verify that a Lac" phenotype is a consequence of 
the sigEA84 sequence being upstream of the lacZ gene the 
transformation into SE84-1, a strain which already 
carries the sigEA84 allele was repeated. In these 
experiments, where the creation of a wild-type 
sigEA84::lacZ fusion was impossible, all of the 
chloramphenicol-resistant transformants that received the 
sigEA84::IacZ DNA were Lac" (25 or 25), while 
approximately 15% (4 of 27) of the transformants which 
received the sigEA84: :lacZ DNA were Lacr. Analysis of the 
level of B-galactosidase in extracts prepared from 
35 s igEA84::lacZ transformants that were phenotypically Lac* 
on plates failed to reveal measurable levels of 
B-galactosidase. Thus, synthesis of the fusion protein 



25 



30 
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mirrors the inability of sigEA84 to form a* itself. In 
both cases, no product was detectable by the assay 
systems employed. 

TABLE 1 

Transformation of B. subtilis by sigE::IacZ DNAs* 



No. of colonies 



10 Recipient Donor DNA 



fusion Total Spo' Spo" Lac" Lac- 
Examined 



JH642 sigE: :2acZ 
15 sigEA84 : : lacZ 

SE84 sigEnlacZ 

si grEA8 4: ilacZ 



38 


38 


0 


38 


0 


32 


32 


0 


8 


24 


27 


27 


0 


23 


4 


25 


25 


0 


0 


25 



■ competent recipient cells were transformed with plasmid 
20 DNA. Transformants were selected on LB plates 

supplemented with chloramphenicol (5 /*gml) . Cm r clones 
were replica plated onto DS plates (22) with and 
without an agar overlay containing X-gal 
(5-bromo-4-chloro-3-indolyl-B-/d- 
25 thiogalactopyranoside) (0.5 mg/ml) (18) . Spo + colonies 

turned brown after 24 hr at 37 °C on DS plates, while 
Spo' colonies remained white. Lac + colonies turned blue 
on DS with X-gal by 18 hr at 37°C. 

30 

The cloned sigE gene was mutagenized by using an 
oligonucleotide (30-mer) that would generate an altered 
sigEA84 with a Lys codon (AAA) between the initiation 
35 codon (AUG) and the codon specifying the a* amino terminus 
(GGC) . Mutant clones were isolated, and their structures 
were verified by DNA sequencing. 



40 



When introduced into B. subtilis, sigEA81 behaved 
identically to sigEA84. It conferred a Spo" phenotype on 
cells which carried it (i.e., when sigEA84 was cloned 
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25 



30 
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into PJM102 and transformed into JH642, approximately 20% 
of the chloramphenicol-resistant transf ormants were Spo) , 
and cells into which the sigEASl allele had been 
confessed synthesized no detectable a E A81 protein. Thus, 
the addition of AAA. at position two of sigEABl yielded no 
measurable increase in a* levels. 

During the course of the mutagenesis protocol a sigE 
mutant was generated (sigEA48) that encoded a protein 
with the four amino acids of the amino terminus of 
sigEASl joined by an Arg residue to residue 22 of P 31 . 
The sigEA48 mutation was initially identified by the 
intermediate size of the PstI DNA fragment which carried 
it. Its structure was determined by DNA sequencing. 

When sigEA48 was moved into B. subtilis, the cells 
became Spo". However, unlike sigEA84 and sigEASl, sigEA48 
synthesized an active a E -like protein. The sigEA48 
product was seen in Western blots, and its activity was 
evident by the activation of the spoIID promoter. The 
activation of the spoIID promoter by sigEA48 does not 
apparently require the processing of 0**48 into the 
wild-type a* protein. Transcription of spoIID occurred in 
B. subtilis strains carrying sigEA48 even if the strain 
had a second mutation in a gene (spoIIGA) that is 
essential for processing. In addition, no change was 
detected in the mobility of <x E A48 at times in sporulation 
(t<) when the processing activity should be present. 
Thus, 0^48 is both insensitive to processing and active 
without processing. The level of product present in the 
sigEA48 strain was comparable to that found in the 
wild-type sigB strain. 

Extracts were produced from B. subtilis strains lacking 
35 a chromosomal copy of sigE but containing plasmid-bome 
copies of either the wild-type (sigE + ) or one of the 
mutant alleles of sigE (pSR51 or pSR54) . Western blot 
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analysis for proteins of the size of either P 31 or ^-sized 
proteins in this system indicated no detectable proteins 
in the extracts prepared from the strains with the mutant 
sigE alleles. This result was the same obtained with the 
5 sigEA84 and sigEA81 alleles. 

EXAMPLE 4 

p-aalactosjHase Fus ion Proteins 

L0 

B. subtilis strains expressing 0-galactosidase fusion 
proteins were constructed as described (Jonas et al. # 
1990) . The E. coli lacZ gene was translationally coupled 
to sigE by cloning a 340-bp Pstl-SauIIIAl fragment of the 

L5 1.1 kbp cloned sigE gene into Pstl-BamHI-cut pSGNU31A2. 
This coupled the 5' end of lacZ in frame to sigE at a 
site 165 bp downstream of the sequence encoding the amino 
terminus of P* 1 . In a similar fashion, the bp segment 
encoding the polypeptide having the amino acid sequence 

20 of Figure 1 was cloned into this vector. This construct 
was formed at the same site within sigE but had only the 
base pairs shown in Figure 2 of the sigE protein-coding 
region upstream of lacZ. JH642 was transformed with 
PS6MD31A2 containing the coding sequence for either the 

25 sigE + or sigEA8 4 -encoded amino terminus. Clones in which 
the plasmid had integrated into the B. subtilis 
chromosome were selected by chloroamphenicol resistance 
(2.5 itg/ml) and screened by Southern blot analysis for 
integration into the sigE locus with the proper sigE 

30 allele positioned upstream of lacZ. 

The half-life of the fusion proteins expressed from 
transformed Escherichia coli or B. suhtilis host cells 
were measured. Table 2 shows the results. 



35 
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Protein 
Sigma E 

0-galactosidase 
0-galactosidase 

Sigma E:SS' 
0-galactosidase: SS 1 

0-galactosidase : SS 1 



-20- 

TABLE 2 

Microorganism 

E. coli 
E. coli 
B. subtilis 
E. coli 
E. coli 
B. subtilis 



ND 
30 min 
ND 

>2 hr 
>2 hr 



i SS is the 29AA sequence shown in Figure 1. 
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25 



30 



35 



e^^m vr*~~*»+iMnm in Protein Stabilisation 
Sequence 

The effect of perturbation of a-helical structure of 
the protein stabilization sequence shown in Figure 1 was 
examined by altering the primary amino acid sequence. 

The DNA sequence shown in Figure 2 was altered by 
insertion of an arginine codon at position 10, resulting 
in a new sequence encoding a 30-amino acid polypeptide, 
in order to effect the insertion, a restriction site was 
created at codon position 11 by substituting codon CTA 
for CTC. The base pair segment coding for the 30-amino 
acid polypeptide and a gene segment encoding 0- 
galactosidase were cloned. Expression of 0-galactosidase 
from a B. subtilis bacterial host cell was examined by 
Western blot. Amounts of expressed fusion polypeptide 
were less than measured in systems utilizing the 29-amino 
acid stabilization sequence. 
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The hydrophobic and hydrophilic faces of the a-helix 
formed by the 30-amino acid polypeptide were twisted 
relative to each other by about 100°. 

5 Additional insertions into the DNA sequence encoding 

the 30 amino acid polypeptide were made. Codons CTC and 
GCC encoding leucine and aialine were inserted between 
codon 10 and codon 11 of the base sequence, resulting in 
a 180° twist of the polypeptide a-helix structure as 

10 compared with the a-helix of the 29 amino acid 

polypeptide structure. The new 32 amino acid polypeptide 
was expressed as a fusion polypeptide with 0- 
galactosidase from a transformed B. subtilis host. 
Expression was less efficient than with fusions formed 

15 from either the 29 or the 30 amino acid protein 

stabilizing sequence, as indicated by Western blot 
analysis. 
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CIAIMS 

1. A protein stabilization sequence comprising an amino 
acid sequence attached to a protein to be stabilized, 
said sequence including a number of amino acid residues 
sufficient to form a structure having an outwardly 
directed hydrophobic face and a positively charged polar 
face. 



10 



2. The protein stabilization sequence of claim 1 
wherein the structure is an o-helix. 



15 3. The protein stabilization sequence of claim 1 having 
up to 100 amino acid residues. 



A. The protein stabilization sequence of claim 1 
20 wherein the outwardly directed hydrophobic face is in 
alignment with the positively charged face. 



5. The protein stabilization sequence of claim 1 
25 wherein the amino acid sequence is from about 10 to 
about 50 amino acid residues. 



6. The protein stabilization sequence of claim 1 
30 wherein the amino acid sequence is 29 amino acid 
residues. 



35 



7. The protein stabilization sequence of claim 6 
wherein the sequence comprises the sequence of Figure 1 
or a biologically functional equivalent thereof. 
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8. A method of stabilizing proteins comprising 
attaching the protein stabilization sequence of claim 1 
to a proteolytically sensitive protein. 



5 



10 



15 



20 



9. The method of claim 8 wherein the attaching is to an 
amino terminal end of a proteolytically sensitive 
protein. 

10. The method of claim 8 wherein the attaching is to a 
carboxy terminal end of a proteolytically sensitive 
protein. 

11. The method of claim 8 wherein the attaching is to a 
proteolytically sensitive protein at both terminii. 

12. The method of claim 8 wherein the proteolytically 
sensitive protein is RNA polymerase sigma factor. 



13. The method of claim 12 wherein the KNA polymerase 
25 sigma factor is from B. snbtilis or E. coll. 



14. A method of enhancing stability of recombinant 
proteins comprising 

30 

obtaining a gene segment encoding the protein 
stabilization sequence of claim 1; 



35 



preparing a recombinant vector having a gene 

encoding the protein stabilization sequence and 
a gene encoding a protein to be stabilized; 



10 
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transforming a host cell with the recombinant 
vector; 

selecting transformed host cell colonies; and 

facilitating expression of a heterologous protein 
fused N- terminally with the protein 
stabilization sequence. 



15. The method of claim 14 wherein the protein to be 
stabilized is 0-galactosidase. 



15 16. The method of claim 14 wherein the host cell is a 
bacterium. 

17. The method of claim 14 wherein the host cell is B. 
20 suJbtilis or E. coli. 



18* A system for production of a stable , heterologous 
protein in prokaryotic hosts comprising DNA having a 
25 coding sequence for the protein stabilization sequence of 
claim 1 operably linked to and out of reading frame with 
a terminal leader DNA sequence encoding a protein 
expressible as an active fusion protein. 

30 

19. A DNA segment which codes for the protein 
stabilization sequence of claim 1 or a functionally 
stabilizing equivalent thereof. 



35 
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20. 0!he DMA segment of claim 19 having the base pair 
sequence of Figure 2 or a functionally stabilizing 
structural equivalent thereof. 



21. A recombinant DNA vector comprising a first base 
sequence coding for the protein stabilization sequence of 
claim 1 or a second base sequence complementary to the 
first base sequence. 



WO 93/03156 



PCT/US92/05810 



1/3 



04 

U 



o 

u 



3 

Q) H 
d H 



CO 



to 

•H O 



CO 



3 
►3 



3 
Q) 
h! 



H O 



3 
0) 
h! 



3 



10 



CD 

Li- 



ra 
Hi 



4J 
0) 



0) 

H CM 



3 

0) 
h1 



3 
0) 

Hi 



U 



CO 



o 

u 



3 

a) in 
h1 h 



(0 



(0 



id 
> 



2 



u 

Eh 



3 



WO 93/03156 



PCT/US92/05810 



0 
0 
Eh 



0 



O 
Eh 
U 



Eh 



o 

u h 



0 
o 



0 

Eh 
O 



0 
H 
Eh 



0 

0 O 
0 oj 



2/3 



0 
0 

u 



< 

Eh 
Eh 



Eh 
U 



CNJ 
CD 



0 
Eh 



< 



0 
Eh 
O 



0 
Eh 
U 



0 
u 



3 

Eh 



3 



U 
Eh 
0 



0 
Eh 



Eh 



2 

0 



WO 93/03156 



PCT/US92/05810 



3/3 



POLAR 




FIG. 3 



I NTERNATIONAL SEARCH REPORT 

International Application No 



[~ r-ULSSIFI CATION OF SU BJEC1 M ATTHt Qf several dassificatio, symbol, apply, i^* H)' 

llnt.Cl. 5 C12N15/62; C12N9/38, 



C12N15/75; 



PCT/US 92/05810 



C07K7/10 



P. FIELDS SEARCHED 
Classification Systea 

llnt.Cl. 5 



C12N ; 



Minimum Documentation S earched* 

Classification Symbols 

C07K 



Docnmentation Searched 
to the Extent that such Documents 



fkai. Minimal* Documentation 
are Included in the Fields Searched 1 



-fr nnniM^TS CONSIDERED TO BE RELEVANT* 

u with indication, nhere appropriate, of the relevant 



Category 0 



Citation of Document, » 

J. BACTERIOLOGY 

vol. 172, no. 8, August 1990, AM. SOC. 
MICROBIOL., BALTIMORE, US; 

R a M 6S J0NAS ET AL. 'Phenotypes of Bacillus 
subtilis mutants altered in the 
precursor-specific region of Sigma E 
see page 4179, right column, line 14 

seepage 4181, right column, line 27 - 

seepage 4185, right column, line 41 - 
line 47 . 



Rclerart to Claim No. u 

1-7, 
19-21 



8-18 



-/- 



.10 



> Special categories irf died documents: 

«itt«r doornail bat published on or after tto intcraaaooal 

filing date 

Sitton or other special reason (as specified) 
-O* doeal> ^referi^ 



»t» fatter document published after the l?^^^ 0 * 
T ^rS^e'and not to conflict ^^^^f 
cited to understand the principle or theory underlying tne 



-X- document of particular relevance; the ^i™^^? 1 
cMmrttecoSdered novd or caiinot be considered to 

involve an inventive step 



•p* document published prior to the mternafional filing date but 
later than ^ priority date claimed 

J IV. CE RTIFICATION 

1 1 Date of the Actual Completion of the International Search 

06 OCTOBER 1992 

International Searching Authority 

EUROPEAN PATENT OFFICE 



*Y* doamentof pardaiJarreleva^ 

cannot be conrid ered to farvolve an ^ventWe st^ irh«A e 
£Snent Is combined with one or more ^othersnch doco- 
^^^coinl^mationbemgobviow 

in the art. 

document member of the same patent family 



Data of Mailing of this Intentatiimal Search Report 

28. ^1 32 

Signature of Authorized Officer 

H0RNIG H. 



Sol 1 : 52, 11 March 1988, CELL PRESS, 
CAMBRIDGE, NA.; 

spoliation sigma f actor in Bacillus 
subtilis: How morphological structure 
coulad control gene expression 
see page 698, right column, line 1 page 
700, right column, line zs_ 

SriTo 10 !! February 1989, NATURE 
AMERICA INC, NEW YORK, US 
^SPELEBUST ET AL. 'Different approaches 
J* SSmie a recombinant fusion protein 
<L otae 167 left column, line 6 - page 
S lol, P U?t column, line 3;_figures 1-5 

EP A 0 098 118 (BERMAN , SILHAVY , WEINSTOCK) 

seltagH, ffi M - page 7, line 5; 
claims 1-18 

MICROBIOL. , BALTIMORE, US; 
uTSetMs IUAND W.G. HALDENWANG . 

SpoIISA, a protein "f"* 1 !^^ 
pro-slpa E processing i« Bacillus 

SUbt il™7a24 left colunn, line 38 - page 

Une figure 1 

iol^rk Jul, 1992, AM. SOC. 
MICROBIOL., BALTIMORE, US; 

FffflS in B AL. -Mutational analysis 
KeTrtcursor-speclfic region of 

Bacmus subtnu ««- ^ 1ine a . 

^635 H^coluSn, Hie 15; figures 



page 
1,5 



